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SEQUENCE LISTING 



(1) GENERAL 



(i) APPLICANT: } 

(A) NAME; \N ovo Nordisk A/S 

(B) STREEH: Novo Alle 

(C) CHY: Bagsvaerd 

(E) COUNTRx: Denmark 

(F) POSTAL \ CODE (ZIP): 2880 

(G) TEIZFHQNE: +45 4444 8888 

(H) TELEFAX +45 4449 3256 

(I) TELEX: S7304 



(ii) TITLE OF 
Form 

(iii) NUMBER OF 

(iv) COMPUTER 

(A) MEDIUM 

(B) CCMTOTER: 

(C) OPERATING 

(D) SOFTWARE: 



[ON: Human Spasmolytic Polypeptide in Glycosylated 



tfCES: 14 
FORM: 

Floppy disk 
PC cxmpatible 
if STEM: PC-DOS/MS-DOS 

atentln Release #1.0, Version #1.25 (EPO) 



(2) INFORMATION FOR SEQ lip NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IENCTH: 106 Wmino adds 

(B) TXPE: amino acid 

(C) STRANDEDNESS^ single 

(D) TOPOLOGY: li 

(ii) MOLECULE THEE: prolfcein 

(iii) HYPOIHEITCAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo skpiens 



(xi) SEQUENCE DESCRIPTION: 



ID NO: 1: 



Glu Lys Pro Ser Pro Cys Gin Cys Ser Arg Leu Ser Pro His Asn Arg 
1 5 I 10 15 

Ihr Asn Cys Gly Phe Pro Gly\Ile Thr Ser Asp Gin Cys Hie Asp Asn 
20 25 30 
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Gly Cys Cys Hie Asp Ser Ser Val Thr Gly Val Pro Trp Cys Phe His 
35 40 45 

Pro Leu Pro Lys Gin Glu Ser Asp Gin Cys Val Met Glu Val Ser Asp 
50 55 60 

Arg Arg Asn Cys Gly Tyr Pro Gly lie Ser Pro Glu Glu Cys Ala Ser 
65 70 75 80 

Arg Lys cys cys Phe Ser Asn Phe lie Phe Glu Val Pro Trp Cys Phe 
85 90 95 

Phe Pro Asn Ser Val Glu Asp Cys His Tyr 
100 105 

(2) INFORMATION FOR SBQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IENG0H: 563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) EEA3URE: 

(A) NAME/KEY: CDS 

(B) IOCATICN: 77.. 553 

(ix) FEATURE: 

(A) NAME/KEY: sig_j?eptide 

(B) LXATICN: 77.. 235 

(ix) EEAIURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 236.. 553 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GAATTCCAIT CAAGAATAGT TCAAACAAGA AGATTACAAA CIATCAA3TT CATACACAAT 60 

ATAAAOGACC AAAAGA ATG AAG GCT GTT TIC TTG GIT TIG TCC TIG ATC 109 

Met Lys Ala Val Phe Leu Val Leu Ser Leu He 
-53 -50 -45 

GGA TTC TCC TCG GCC CAA CCA GTC ACT GGC GAT GAA TCA TCP GTT GAG 157 
Gly Phe cys Trp Ala Gin Pro Val Thr Gly Asp Glu Ser Ser Val Glu 
-40 -35 -30 



WO 94/17102 PCT/DK94/00037 

30 

ATT COG GAA GAG TCT CTG ATC ATC GCT GAA AAC ACC ACT TIG GCT AAC 205 
He Pro Glu Glu Ser Leu He He Ala Glu Asn Thr Thr Leu Ala Asn 
-25 -20 -15 

GTC GCC ATC GCT GAG AGA TIG GAG AAG AGA GAG AAA CCC TCC CCC TGC 253 
Val Ala Met Ala Glu Arg Leu Glu Lys Arg Glu Lys Pro Ser Pro Cys 
-10 -5 1 5 

CAG TGC TCC AGG CTG AGC CCC CAT AAC AGG AOG AAC TGC GGC TTC CCT 301 
Gin Cys Ser Arg Leu Ser Pro His Asn Arg Thr Asn Cys Gly Phe Pro 
10 15 20 

GGA ATC ACC AGT GAC CAG TGT TIT GAC AAT GGA TGC TCT TTC GAC TCC 349 
Gly lie Thr Ser Asp Gin Cys Phe Asp Asn Gly Cys cys Phe Asp Ser 
25 30 35 

AGT GTC ACT GGG GTC CCC TGG TGT TTC CAC CCC CTC OCA AAG CAA GAG 397 
Ser Val Thr Gly Val Pro Trp Cys Phe His Pro Leu Pro Lys Gin Glu 
40 45 50 

TOG (»T CAG TGC GTC ATG GAG GTC TCA GAC AGA AGA AAC TGT GGC TAG 445 
Ser Asp Gin Cys Val Met Glu Val Ser Asp Arg Arg Asn Cys Gly Tyr 
55 60 65 70 

COG GGC ATC AGC CCC GAG GAA TGC GCC TCT GGG AAG TGC TGC TTC TCC 493 
Pro Gly He Ser Pro Glu Glu Cys Ala Ser Arg lys Cys cys Phe Ser 
75 80 85 

AAC TTC ATC TTT GAA GIG CCA TGG TGC TTC TTC COG AAC TCT GIG GAA 541 
Asn Phe lie Phe Glu Val Pro Trp Cys Phe Phe Pro Asn Ser Val Glu 
90 95 100 

GAC TGC CAT TAC TAAGTCTAGA 563 

, Asp cys His Tyr 

105 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Lys Ala Val Phe Leu Val Leu Ser Leu lie Gly Phe Cys Trp Ala 
-53 -50 -45 -40 

Gin Pro Val Thr Gly Asp Glu Ser Ser Val Glu lie Pro Glu Glu Ser 
-35 -30 -25 



Leu lie lie Ala Glu Asn Thr Thr Leu Ala Asn Val Ala Met . Ala Glu 
-20 -15 -10 
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Ara Leu Glu Lys Arg Glu Lys Pro Ser Pro Cys Gin Cys Ser Arg Leu 
-5 1 5 10 

Ser Pro His Asn Arg Thr Asn Cys Gly Phe Pro Gly lie Thr Ser Asp 
15 20 25 

Gin Cys Phe Asp Asn Gly Cys Cys Phe Asp Ser Ser Val Thr Gly Val 
30 35 40 

Pro Trp Cys Phe His Pro Leu Pro Lys Gin Glu Ser Asp Gin Cys Val 
45 50 55 

Met Glu Val Ser Asp Arg Arg Asn Cys Gly Tyr Pro Gly He Ser Pro 
60 65 70 75 

Glu Glu Cys Ala Ser Arg Lys cys cys Hie Ser Asn Phe lie Phe Glu 

80 85 90 

Val Pro Trp Cys Phe Phe Pro Asn Ser Val Glu Asp Cys His Tyr 
95 100 105 

(2) INFOFMATION FOR SEQ ID NO; 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I£NCTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA . 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

. (A) ORGANISM: synthetic 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 4: 
GGCTGAGCCC OCATAACAG 19 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TGGAAACACC AGGGGAC 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GAGAAACCCT CCCCCTGQCA GTGCICCAGG C 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CENA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TCAGOCTGGA GCACTCGCAG GGGGAGGGTT TCTC 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: * NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GCIGAGAGAT TGGAGAAGAG AGAGAAACCC TCCCCCT 37 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ 3D NO: 9: 
TGOCTCATGG AGCTCTC 17 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IENGIH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AGCACCATGG CACTTCAAAG 20 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



# 



# 
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(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



CTCOOCTGCT GTITOCACCC CCTCCCAAAG CAAGAGTOGG ATCAGTCOGT CATGGAGGTC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: single 

(D) TOPOIOGY: linear 

(ii) MDIECUIE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TGAGACCTOC ATGAOGCACT GATCCGACTC TTGCTTTCGG AGGGGGTGGA AACACCAGGG 60 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CATGGTGCIT CTTCCOGAAC TCTCTGGAAG ACIGCCATTA CTAAGT 46 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 46 base pairs 
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(B) TYfeE: nucleic acid 

(C) STW^NDECNESS: single 

(D) TOK>IOGY: linear 



(ii) KDLECUUE 

(iii) HYPOIHETI 

(vi) ORIGINAL 
(A) 




: .synthetic 



(xi) SEQUENCE EES 
CIBGACXTAG TAATCGCAGT 




# 



ION: SEQ ID NO: 14: 

GTTOGGGAAG AAGCAC 



46 



